Similarity between term senses in a lexical network
نویسندگان
چکیده
Typed lexical relations between terms are indispensable for the tasks realized in NLP, but collecting lexical information is a difficult process. Indeed, when done manually, it requires the competence of experts and the duration can be prohibitive. When done automatically, the results can be biased by the chosen corpus of texts. The approach we present here consists in having people take part in a collective project by offering them a playful application accessible on the web. From an already existing base of terms, the players themselves thus build the lexical network, by supplying associations which are validated only by an agreeing pair of users. Furthermore, these typed relations are weighted according to the number of pairs of users who provide them. We then approach the question of word usage determination for a term, by searching relations between this term and its neighbours in the network, before introducing the notion of similarity between these different word usages. We are thus able to build the tree of word usages for a term. Finally, we briefly present the realization and the first obtained results. MOTS-CLÉS : traitement automatique du langage naturel, réseau lexical, relations typées pondérées, sens d’usage d’un terme, jeu en ligne.
منابع مشابه
A Robust Approach to Aligning Heterogeneous Lexical Resources
Lexical resource alignment has been an active field of research over the last decade. However, prior methods for aligning lexical resources have been either specific to a particular pair of resources, or heavily dependent on the availability of hand-crafted alignment data for the pair of resources to be aligned. Here we present a unified approach that can be applied to an arbitrary pair of lexi...
متن کاملDiscriminative Ability of WordNet Senses on the Task of Detecting Lexical Functions in Spanish Verb Noun Collocations
Collocations, or restricted lexical co-occurrence, are a difficult issue in natural language processing because their semantics cannot be derived from the semantics of their constituents. Therefore, such verb-noun combinations as “take a break,” “catch a bus,” “have lunch” can be interpreted incorrectly by automatic semantic analysis. Since collocations are combinations frequently used in texts...
متن کاملAligning Verb Senses in Two Italian Lexical Semantic Resources
This work describes the evaluations of three different approaches, Lexical Match, Sense Similarity based on Personalized Page Rank, and Semantic Match based on Shallow Frame Structures, for word sense alignment of verbs between two Italian lexical-semantic resources, MultiWordNet and the Senso Comune Lexicon. The results obtained are quite satisfying with a final F1 score of 0.47 when merging t...
متن کاملMeasuring the Impact of Sense Similarity on Word Sense Induction
Word Sense Induction (WSI) is an unsupervised learning approach to discovering the different senses of a word from its contextual uses. A core challenge to WSI approaches is distinguishing between related and possibly similar senses of a word. Current WSI evaluation techniques have yet to analyze the specific impact of similarity on accuracy. Therefore, we present a new WSI evaluation that quan...
متن کاملCombining Word Embedding and Lexical Database for Semantic Relatedness Measurement
While many traditional studies on semantic relatedness utilize the lexical databases, such as WordNet or Wikitionary, the recent word embedding learning approaches demonstrate their abilities to capture syntactic and semantic information, and outperform the lexicon-based methods. However, word senses are not disambiguated in the training phase of both Word2Vec and GloVe, two famous word embeddi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- TAL
دوره 50 شماره
صفحات -
تاریخ انتشار 2009